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This study of prefilters and sampling rates analyzes the edge busyness 
phenomenon which is a very significant degradation introduced by intra- 
frame video coders. For any given transmission rate, the prefilter and 
sampling rate can be varied to yield a tradeoff between the edge busyness 
and the rise time associated with a video transition. A narrower prefilter 
results in less edge busyness but slower rise times and vice versa. Similarly, 
varying the sampling rate implies an inverse variation in the number of bits 
per sample allowed in the quantizer, which leads to a tradeoff between alias- 
ing and quantizing noise, both of which are components of the edge busy- 
ness phenomenon. This paper shows in detail how the prefilter, sampler, 
and quantizer affect both edge busyness and rise time, and optimizes the 
intraframe coder design as a function of transmission rate from an analysis 
of this two-factor tradeoff. 

I. INTRODUCTION 

The goal of this paper is to study the effects of filtering and sampling 
in the digital processing of signals for Picturephone® service. In doing 
this we are ultimately interested in choosing the optimum prefilters 
and sampling rates (and therefore the number of bits per sample used 
in the quantizer) for intraframe DPCM codecs. To simplify the analy- 
sis it is assumed that the edge busyness phenomenon is the most sig- 
nificant degradation introduced by these codecs. It will be shown how 
this edge busyness is affected by each part of the codec and how it can 
be controlled by trading it off against picture resolution. The main 
theme of this study is this tradeoff between the detrimental effects of 
both edge busyness and degraded rise times. Both factors are associ- 
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ated with video edges and hence only edges, modeled as step functions, 
will be considered. 

Section II is a brief description of the analog Picture-phone system, 
which outlines relevant short-haul system specifications. Section III 
defines edge busyness and shows why it is such a significant video im- 
pairment ; it also describes how edge busyness is affected by each part 
of the codec and how it can be controlled by trading it against picture 
resolution. This edge busyness— resolution tradeoff is further investi- 
gated in Section IV which uses a series of subjective pair comparison 
tests to determine the optimum tradeoff as a function of system quality. 
Section V uses the results of Sections III and IV to show that optimum 
coder composition, in terms of prefiltering, sampling rate, and quan- 
tizer structure, is a function of the digital transmission rate. This study 
is concluded in Section VI with a subjective testing program which 
substantiates the validity of the edge busyness— rise time evaluation of 
video coders and rates a number of intraframe coders on a five-com- 
ment impairment scale. 

II. ANALOG SYSTEM DESCRIPTION 

For the purposes of this study, the important parts of the analog 
Picture-phone system can be modeled as illustrated in Fig. 1. Here both 
the camera system and all analog links connecting the transmitter to 
the receiver are nominally flat to 1 MHz. If all the frequency shaping 
in the receiver (excluding de-emphasis) is combined into one equivalent 
rolloff filter, the resulting filter has a crisp ened Gaussian response that 
rolls off to - 20 dB at 1 MHz. Hence the total optics-to-optics step re- 
sponse of the analog Picturephone system is dominated by that of the 
receiver station set.* This total optics-to-optics rolloff and overshoot 
characteristic is given by : f 

W/)l 2 = {[1 + K(f/TY1 exp [-0.5(//T) s ]} J (1) 

where / is in MHz and : 

K = 0.5292 

T = 0.35592. 

From this the step response can be shown to be (assuming linear phase 
with a slope of to) : 

s(t) = i Erf (B) + exp ( - B 2 ) (2) 

v¥ 



* See Ref. 1, pp. 291-292. 

f See Ref. 2, eq. (8), but add a set of missing parentheses. 
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Fig. 1 — Analog system model. 



B = y/2wT(t - to) X 10 f 



This step response is plotted in Fig. 2. The response is symmetric about 
the half-amplitude point with a 4-percent single undershoot and over- 
shoot and has a 10- to 90-percent rise time (T R ) of 0.69 /*s. 

III. DIGITAL SYSTEM DESCRIPTION 

The essential parts of the intraframe DPCM digital system model 
are illustrated in Fig. 3. In addition to the analog system already de- 
scribed, there is a prefilter, and there are sampling circuits and a quan- 
tizer-decoder combination. The prefilter limits line interference and 
shapes the analog video signal to be encoded. The sample-and-hold and 
sampling clock circuits turn the analog video into a discrete time for- 
mat. The quantizer is used to classify the sample-and-hold outputs into 
a finite number of values for transmission over a given digital channel. 
The decoder is the inverse of the DPCM quantizing algorithm and 




TIME IN (J.S 

Fig. 2 — Analog system step response. 
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Fig. 3 — Digital system model. 



puts out the quantized version of the coder's sample-and-hold wave- 
form to the receiving analog system. 

In order to see what the codec does to a video transition (such as the 
edge of a face or a shirt or any black- white boundary), let us start with 
the simplest part of the codec and study its effect on the video signal. 
Then the remaining parts of the codec can be added one at a time and 
analyzed. All step responses will be measured at the output of the 
analog receiver, as would be seen by the user. 

3.1 Sampling Circuits 

Consider the simplest video codec consisting of only a sample-and- 
hold circuit driven by a sampling clock. Hence the prefilter, quantizer, 
and decoder circuits of Fig. 3 are deleted. If we model the ouput of the 
Picture-phone transmitter as a perfect step function (the validity of this 
model will be discussed in the next section), the input to the codec 
sample-and-hold circuit is illustrated in Fig. 4a. If the sampling instants 
of the coder clock are given by sampling Phase 1 in Fig. 4a, then the 
outputs of the sample-and-hold circuit and the receiver rolloff filter are 
those designated Phase 1 in Figs. 4b and 4c, respectively. If the phasing 
of the sampling clock with respect to the step function was changed by 
a quarter of the sampling period, the sampling instants could be given 
by sampling Phase 2 of Fig. 4a. The resulting outputs of the sample- 
and-hold circuit and the receiver rolloff filter would be given by the 
Phase 2 curves in Figs. 4b and 4c, respectively. The outputs from this 
second phasing of the sampling clock would be the same as those of the 
first phasing delayed by T/4. If the sampling clock were delayed as 
given by sampling Phase 3 and 4, the corresponding outputs would be 
given by the Phase 3 and 4 waveforms in Figs. 4b and 4c, respectively. 
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In this way we can see that the step response of this digital system 
(always observed at the output of the receiver rolloff filter) will always 
have the same shape as that of the analog system. The time of occur- 
rence of the step response will depend on the relative phasing of the 
sampling clock with respect to the camera step function. This indeter- 
minacy of the edge location due to the sampling phase dependence is 
one specific example of aliasing and will henceforth be called sampling- 
induced busyness (SIB). Quantitatively, this sampling-induced busy- 
ness will be measured as the maximum indeterminacy of the edge at 
the 50-percent amplitude point. This measure is illustrated in Fig. 4c 
and in this case is equal to T, one sampling interval. 

Sampling-induced busyness is detrimental because it causes the 
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Fig. 4 — (a) Step input from camera, (b) Decoder output, (c) Rolloff filter output. 



502 



THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1973 



breakup of edges and is easily seen in a typical Picture-phone display. To 
see how this breakup occurs, turn to Fig. 5. This represents an enlarge- 
ment of a small section of a Picturephone display. Only five (LI to L5) 
of the 267 lines in a frame are illustrated, and only four sampling in- 
stants (SI to S4) are shown. The main diagonal represents the edge of 
a black-white transition. In the original scene all the area to the left of 
the edge is black and all area to the right of the edge is white. On line 
LI, the black- white transition is detected by the sampler at sampling 
instant S4. On line L2, the transition is also detected at sampling in- 
stant S4. It is detected at S3 on line L3 and at S2 on both lines L4 and 
L5. Thus the transition is detected at the times indicated by circles on 
Fig. 5. After these sample-and-hold waveforms are passed through the 
receiver rolloff filter, the edge of the black-white transition would ap- 
pear to the observer to be the crooked dashed curve of Fig. 5. 

For a codec whose sampling clock is locked to the raster, the 
above curvature is motionless if the edge is still. If the codec is not 
locked to the raster or if the edge is moving from frame to frame, the 
above curvature will crawl through the picture. A straight line will be 
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Fig. 5 — Sampling-induced busynesB. 
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turned into a moving crooked edge. This effect is extremely visible 
and quite annoying for an unfiltered sampled-and-held picture. 

The only effect of introducing the sampling circuits into the analog 
system is this edge indeterminance we have called sampling-induced 
busyness. The rise time of the system has not been affected by the 
sampling circuits. 

3.2 Prefilter 

The addition of the prefilter changes the input to the sample-and- 
hold circuit from a perfect step function to a gentler transition with 
a nonzero 10- to 90-percent rise time as might be given in Fig. 6a. The 
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Fig. 6 — Effect of prefilter on SIB. (a) Prefilter output, (b) Sample-and-hold output, 
(c) Picturephone® receiver output. 
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Fig. 7 — Sampling-induced busyness — rise time tradeoff. 

more prefiltering, the gentler the transition. The effect that this pre- 
filter has on the decoder step response is illustrated in Fig. 6b for four 
particular sampling clock phasings (in particular those phasings given 
by t = 0, T/4, T/2, and 3/!F4 on the time scale of Fig. 6a). As opposed 
to the no prefilter case where the step functions are simply delayed in 
time (see Fig. 4b), the addition of the prefilter results in an overlapping 
of the decoder step responses. After these decoder outputs are passed 
through the receiver rolloff filter,* the sharp corners will be smoothed 
and the width of the step response envelope will be reduced from the no 
prefilter case, as can be seen by comparing Fig. 6c with Fig. 4c. Thus 
sampling-induced busyness can be reduced at the expense of degraded 
rise times. The exact tradeoff between sampling-induced busyness 
(SIB) and average rise time (T R ) is given in Fig. 7 for a Gaussian- 

* See the Appendix for an outline of the computer program used to derive system 
step responses for any combination of transition step size, prefilter step response, 
sampling rate, and quantizer. 
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shaped prefilter. The subjective weighting between these two factors is 
dependent upon picture content and will be discussed in Section IV 
after the quantizer effects are added. 

The curve of Fig. 7 corresponds to a fixed sampling rate. Higher 
sampling rates will shift the curve closer to the analog case, and lower 
sampling rates will shift the curve further away from the analog case. 
This effect is illustrated in Fig. 8 for no prefiltering and for a Gaussian 
prefilter 6 dB down at 1 MHz. The higher the sampling rate, the 
closer the knee of the tradeoff curve approaches the analog condition 
of no busyness and a 0.69-/xs rise time. Other families of prefilter shapes 
(non-Gaussian) produce slightly different SIB- 7^ curves, but all ex- 
hibit the same tradeoff, reducing one only at the expense of increasing 
the other. 

At this point it is necessary to discuss the validity of modeling the 
output of the Picture-phone camera system as an ideal step function. 
The camera frequency characteristic is nominally flat to 1 MHz; 
therefore, as explained in Section II, the optics-to-optics step response 
is governed by the rolloff filter that is 20 dB down at 1 MHz and the 
ideal step function is a good model for the analog system. Since there 
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is a filter in the front end of the codec, the step-function model is also 
appropriate for the digital case as long as the frequency characteristic 
of the prefilter dominates that of the camera system. All prefiltering 
that will be seriously studied will be from 6 to 40 dB down at 1 MHz 
and will indeed dominate the camera system. In some instances, re- 
sults will be given for less prefiltering (as in the case of points and 0.1 
in Fig. 7) and then it should be realized that these are somewhat 
idealized cases used only for illustrative purposes. 

3.3 Quantizer 

The addition of the quantizer-decoder combination completes the 
digital system given in Fig. 3. The quantizer is required in order to 
transmit over a finite digital channel. This necessary addition further 
confuses the edge indeterminacy — rise time picture given by the pre- 
filtering and sampling circuits. The quantizer both produces its own 
edge indeterminance and affects average rise times. To see how a quan- 
tizer can produce edge indeterminance, refer to Fig. 9a. Here the step 
response of the prefilter is given by a smooth straight line. The quan- 
tizer used in this example has the standard Phase 0, companded DPCM 
3-bit characteristic described in Ref. 3. With the sampling instants 
given at the bottom of Fig. 9a, the prefilter output could be coded as 
either decoder output No. 1 or No. 2, depending on slight noise varia- 
tions at the beginning of the step. After passing through the receiver 
rollofT filter, the response to decoder outputs No. 1 and No. 2 is given 
in Fig. 9b by rolloff filter outputs No. 1 and No. 2 respectively. The 
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Fig. 10 — Step response envelopes. 

resulting edge indeterminance (again measured at the 50-percent am- 
plitude point) is due to the quantizer and will henceforth be called 
quantizer-induced busyness (QIB). This quantizer-induced busyness 
is any resulting edge indeterminance over and above that which would 
be produced by just the prefiltering and sampling circuits. The total 
edge indeterminance resulting from a combination of both sampling- 
and quantizer-induced busyness will be referred to as edge busyness 
(EB). In all cases the edge busyness effect is calculated by varying the 
phase of the sampler. This phase difference can result in practice in 
many ways. It occurs spatially for a slanted line. It can result from 
a moving scene, from an unsynchronized codec, or from the residual 
phase jitter in a synchronized codec. It can also be shown that to some 
extent random noise on the video signal can be mapped into an equiva- 
lent sampling phase jitter. 

The envelopes of possible step responses at the output of the receiver 
rolloff filter for the Phase 1 system are given in Fig. 10 for a Gaussian 
prefilter that is 0.1, 6, and 12 dB down at 1 MHz (see the Appendix for 
calculations). This Phase 1 system employs a 1. 856-MHz sampler and 
a companded DPCM 4-bit quantizer which is described in detail in 
Table I. Again the prefilter yields a tradeoff between edge busyness 
and rise time. For any combination of sampling rate and quantizer, 
the prefilter yields this same type of tradeoff. To assign numbers to this 
characteristic, Fig. 11 plots edge busyness versus rise time for the 
Phase 1 system with varying amounts of Gaussian prefiltering. Also 
plotted for comparison purposes is the sampling-induced busyness — 
rise time tradeoff already given in Fig. 7. 
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Table I — Phase 1, 4-Bit Quantizer Characteristic 



Output Level Number 



Quantized Signal Output* ( 2g J 



1, 


2 


3, 


4 


5, 


6 


7, 


8 


9, 


10 


11, 


12 


13, 


14 


15, 


16 



± 1 

± 3 
± 7 
±15 
±23 
±31 
±39 
±47 



* Weights are relative to the smallest quantum 9tep of a 7-bit PCM system. See 
Ref. 4 for additional information. 



The addition of the quantizer complicates the problem even more 
than already discussed. Up to this section the results have been inde- 
pendent of the magnitude of the black- white transition. Fig. 11 results 
from using a 40-percent black-white transition with the Phase 1 4-bit 
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Fig. 11 — Edge busyness — rise time tradeoffs. 
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Fig. 12 — Edge busyness and rise time vs step size. 



quantizer. Both edge busyness and rise time are plotted versus step 
size in Fig. 12 for the Phase 1 coder using its 6-dB Gaussian prefilter. 
The shape of both curves can be explained using Fig. 13, where both 
the quantizing noise and the step-size-to-quantizing-noise ratio are 
plotted versus step size. Both edge busyness and rise time are inversely 
proportional to the signal-to-noise ratio and are therefore minimized at 
a step size of 30 to 40 percent where the signal-to-noise ratio is maxi- 
mized. For the remainder of this study only three measures of system 
quality will be considered. These three measures are : 

(i) the edge busyness and rise time of the 40-percent step size 

(M4), 
(ii) the average edge busyness and average rise time for the 10, 20, 

30, and 40 percent step sizes (Ml-4), 
(in) the average edge busyness and average rise time for the 10 step 
sizes from 10 to 100 percent (Ml-10). 
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Fig. 13 — Quantizer noise and SNR vs step size. 



Measure M4 has the advantage of simplicity. Edge busyness can 
easily be seen on a transition of this size. This is also the breakpoint 
shown in Fig. 13 before the quantizer's slope overload characteristic 
takes effect. Measure Ml-4 averages over the lower four step sizes. 
Typically 95 to 99 percent of all transitions are below a 40-percent step 
size for a 2-MHz sampler. Measure Ml-10 uses the most information 
and disregards all assumed knowledge of observer subjective effects. 
All three measures were used in designing the coders that will be dis- 
cussed in the following sections. Ultimate justification for these mea- 
sures rests with the subjective evaluation of coders with quantizers as 
will be discussed in Section VI. 
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IV. SUBJECTIVE EVALUATION OF THE EDGE BUSYNESS — RISE TIME TRADE- 
OFF 

In the last section it was shown how the variation of the prefilter 
yielded a tradeoff between edge busyness and average rise time for any 
combination of sampling rate and quantizer structure. In this section 
a series of subjective tests will be described that were run to find the 
subjectively optimum edge busyness — rise time tradeoff. For these tests 
only sample-and-hold circuits were used in order to avoid the question 
of which measure to use when quantizers are involved. In Section 
VI a series of subjective tests will be described using coders with 
quantizers. 

The test consisted of asking each of 30 observers to rate various sys- 
tems on the basis of three different source pictures. The first camera 
source was a mannequin, set up as shown in Fig. 14. This mannequin, 




Fig. 14 — Microhenry. 
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"Microhenry" or "Mike" for short, is mounted on a platform which 
tilts him in a symmetric, sinusoidal fashion from side to side. He is 
pivoted approximately 14 inches below his chin and was moved a 
maximum of ± 13 degrees from the vertical at a rate of one cycle every 
4.3 seconds. The second and third camera sources were slides of Karen 
and a graph, as shown in Figs. 15 and 16, respectively. Both of these 
sources were stationary. For all three sources, each of the 30 observers 
was given an AB type test wherein the observer was presented with 36 
different pairs of systems and asked to pick the most pleasing or the 
most readable of each pair (most pleasing for each test given on Mike 
and Karen and most readable for those tests given on the graph) . These 
36 pairs include all meaningful comparisons between nine different sys- 
tems. If the pair (A, B) = (system 6, system 9) was given, then pair 
(A, B) = (system 9, system 6) would not be included. Similarly there 
was no AB of any system with itself. Each of these nine systems con- 
sisted only of a prefilter followed by a sample-and-hold circuit. All pre- 
filters had a crispened Gaussian amplitude characteristic which, along 




Fig. 15 — Karen. 
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with near-linear phase, resulted in a symmetric step response with 
single 4-percent undershoot and overshoot. Five different prefilters 
were used, being designed to be 3, 6, 10, 20, and 40 dB down at 1 MHz. 
After tuning, these filters were measured to be 3.0, 5.4, 9.6, 18.4, and 
38. 1 dB down at 1 MHz with approximately linear phase. The sample- 
and-hold circuit was driven by one of three sampling clocks, each of 
which was synchronized to the camera video. The three sampling 
clocks had synchronized frequencies of 2.015, 1.512, and 1.200 MHz. 
The particular numbering sequence with system characteristics is 
listed in Table II. Both the predicted and the measured performance 
characteristics of these nine systems, as specified on an edge busyness — 
rise time chart, are given in Fig. 17. Each test was given using an 
unmodified Mod 2C Picturephone station set as the receiver, with 
the observer given complete freedom of its brightness control. Process- 
ing the resulting data for each camera source using standard-pair com- 
parison techniques 5 results in the scale values plotted in Fig. 18. These 
scale values are a psychological measure of relative coder preference; 
the higher the scale value, the more the coder is preferred. The least 
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Table II — System Characteristics 



System Number 


Sampling Rate 


Prefilter* 


1 


2.015 MHz 


6 


2 
3 
4 
5 


1.512 MHz 
1.512 MHz 
1.512 MHz 
1.512 MHz 


3 
10 
20 
40 


6 

7 
8 
9 


1.200 MHz 
1.200 MHz 
1.200 MHz 
1.200 MHz 


3 
10 
20 
40 



* Specified by the number of dB down at 1 MHz. 
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Fig. 17 — Edge busyness — rise time tradeoffs for subjective tests. 
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Fig. 18 — Psychological scale values for Mike, Karen, and graph. 



preferred coder is arbitrarily assigned the value zero with all other 
coders scaled relative to the worst. The following properties should be 
noted : 

(i) No intersource numerical comparisons should be made. A scale 
value of 4.13 for the best graphics coder and 3.13 for the best 
coder on Mike only means that the observers were more dis- 
criminating on the graphics source. It does not mean that the 
best graphics coding is better than the best coding of Mike, 
(u) Mike is somewhat of a worst case for aliasing. His shirt was 
chosen for its narrow stripes because undersampling is most 
harmful when appreciable high-frequency content is present. 
This undersampling causes the straight edges to appear crooked 
and the moving subject imparts a distinctive motion to them. 
On Mike most observers found aliasing to be the only degrada- 
tion caused by the different systems and hence the narrower 
prefilters were preferred because they yielded less aliasing at 
the expense of lower picture resolution. A prefilter that is 20 
dB down at 1 MHz was preferred for the 1.5-MHz sampling 
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rate and a prefilter 40 dB down was preferred for the 1.2-MHz 
system. 

(Hi) Karen has been a much-used source where both edge busyness 
and resolution are important. The aliasing still shows up as 
crooked stripes in her blouse, but they are motionless for our 
synchronized clocks and therefore not quite as annoying. For 
this source a 20-dB prefilter was preferred for the 1.5-MHz sys- 
tem and a 3-dB prefilter was preferred for the 1.2-MHz system. 

(iv) The graph's sole property is readability. Here the 10-dB pre- 
filter was preferred for the 1.5-MHz system and the 20-dB 
prefilter was preferred for the 1.2-MHz system. 

Since Mike is a worst case for aliasing, Karen a difficult source for 
both aliasing and resolution, and the graph a test of readability, the 
three sources are considered to be of equal importance, and all 90 re- 
sponses will be lumped together for the final preference scale. Analyz- 
ing these 90 responses by the pair comparison analysis 5 results in the 
scale values shown in Fig. 19. The 2-MHz system is preferred over the 
best 1.5-MHz system which is in turn preferred over the best 1.2-MHz 
system. There is a fairly broad range of optimum prefiltering for the 
1.5-MHz system with the best prefilter 20 dB down at 1 MHz. For 
the 1.2-MHz system, the narrower the prefilter the better, with the 
best choice of those studied being 40 dB down at 1 MHz. 

As shown in Fig. 17, these results indicate that the best subjective 
tradeoff between edge busyness and rise time occurs just below the knee 
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Fig. 19 — Psychological scale values resulting from all 90 responses. 
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Fig. 20 — Edge busyness — rise time tradeoffs for the Phase 1 quantizer. 

of each tradeoff curve where the incremental reduction in edge busyness 
becomes small compared to the incremental increase in average rise 
time. 



V. OPTIMIZING THE SAMPLING RATE FOR A FIXED TRANSMISSION RATE 

For the Phase 1 4-bit companded DPCM quantizer given in Table 
I, the variation of both prefilter and sampling rate leads to the trade- 
offs between edge busyness and average rise time given in Fig. 20. These 
curves are not unexpected ; for any combination of sampling rate and 
quantizer, the prefilter leads to a tradeoff between edge busyness and 
average rise time. The higher the sampling rate (and therefore the 
higher the transmission rate in this case), the closer the knee of 
the tradeoff curve approaches the analog case and therefore the 
better the picture quality. Figure 20 results from using a crispened 
Gaussian prefilter. Many other filter shapes were studied in an attempt 
to find the best edge busyness — rise time tradeoff curves. At the higher 
sampling rates (namely 1.856 MHz and above) the prefilter shape be- 
comes unimportant ; here, less prefiltering is required and the receiver 
rolloff filter becomes increasingly dominant. As the sampling rate is 
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reduced, two things are desired in filter shaping : rapid amplitude fall- 
off versus frequency and controlled overshoot in the time domain. The 
rapid falloff is required to control aliasing without removing significant 
video information and controlled overshoot is needed to present a pleas- 
ing picture, one without excessive ringing. The crispened Gaussian 
filter shape with 4-percent overshoots, as specified by eq. (1) with 
K = 0.5292 but with T a variable, yields a good compromise between 
these two factors. 

The results given in Fig. 20 show the effects of different sampling 
rates with a single quantizer. The transmission rate is proportional to 
the sampling rate and the number of bits used to code each sample. 
Therefore, in the extreme case, we are comparing the performance of 
an 8-Mb/s transmission system (2-MHz sampler-4-bit quantizer) 
with that of a 4-Mb/s transmission system (1-MHz sampler-4-bit 
quantizer). The more interesting tradeoff results from considering a 
specific transmission rate. For any given transmission rate, what is the 
best combination of prefilter, sampling rate, and quantizer which 
minimizes both edge busyness and average rise time? A higher sampling 
rate implies fewer quantizing levels which means less SIB but more 
QIB. A lower sampling rate implies more quantizing levels which 
means more SIB but less QIB. The prefilter is used to trade off edge 
busyness against picture resolution to yield the best subjective picture 
quality for each sampling rate-quantizer combination. 

To be specific, given the following three transmission rates* 

(i) 6. Mb/s 
(it) 7.5 Mb/s 
(m) 8. - 9. Mb/s 

what is the optimum combination of sampling rate and number of 
bits /sample? 

For the 6.-Mb/s rate, the first three coders listed in Table III are 
considered. In order to simplify the calculations, only the first per- 
formance measure, M4, slightly modified, is calculated. A 40-percent 
video transition is used with the 2.0-MHz system, a 50-percent transi- 
tion with the 1.5-MHz system, and a 60-percent transition with the 

* The 6.-Mb/s rate results from the simplest use of a T2 line. The effective bit rate 
of 7.5 Mb/s results from using 300 cells of storage to expand video information into 
the horizontal sync pulse interval with subsequent transmission over a T2 line. An 
effective bit rate of 8 to 9 Mb/s can be achieved over a T2 line by using about 20,000 
cells of storage (see Ref . 6) to use both the horizontal sync pulse interval and variable 
length coding. This variable length coding takes advantage of the statistical properties 
of the video signal. 
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Table III — Coder Characteristics 


Transmission Rate 


Sampling Rate 


Quantizer 


6.0 Mb/s 
6.0 Mb/a 
6.0 Mb/s 


2.0 MHz 
1.5 MHz 
1.2 MHz 


3-bit DPCM 
4-bit DPCM 
5-bit DPCM 


7.5 Mb/s 
7.5 Mb/s 
7.5 Mb/s 


1.9 MHz 
1.5 MHz 
1.2 MHz 


4-bit DPCM 
5-bit DPCM 
6-bit DPCM 


8.3 Mb/s 
9.0 Mb/s 

8.4 Mb/s 


1.9 MHz 
1.5 MHz 
1.2 MHz 


5-bit DPCM 
6-bit DPCM 
7-bit DPCM 



1.2-MHz system. This modification accounts for the longer sampling 
intervals encountered with the lower sampling rates. For each coder 
a wide range of prefilters was evaluated and the optimum prefilter 
chosen using the subjective results given in Section IV. The prefilter 
which results in an edge busyness — rise time tradeoff just below the 
knee of the curve was chosen. The three resulting systems are given by 
the upper curve of Fig. 21. For the effective bit rate of 7.5 Mb/s, the 
middle three coders listed in Table III are considered. Using similar 
optimization procedures, these three coders result in the three edge 
busyness — rise time points given by the middle curve of Fig. 21. For 
the 8. to 9.-Mb/s transmission rate, it is assumed that variable length 
coding will result in an extra bit per sample for each of the sampling 
rates used. Here the three coders listed in the bottom of Table III are 
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Fig. 21 — Edge busyness — rise time tradeoffs for fixed transmission rates. 
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considered. These coders result in the three edge busyness— rise time 
points given by the lower curve of Fig. 21. 

Examination of Fig. 21 leads to the following conclusions: 

(i) For intraframe coders operating at an effective bit rate of 6 
Mb/s (points 3b-2., 4b-1.5, and 5M.2 on Fig. 21), the edge 
busyness— rise time tradeoff is optimized by using a 1.5-MHz 
sampling rate and a 4-bit quantizer coupled with a Picture- 
phone-type prefilter 20 dB down at 1 MHz. 
(ii) For intraframe coders operating at an effective bit rate of 7.5 
Mb/s (points 4b-1.9, 5b-1.5, and 6b-1.2), the edge busyness- 
rise time tradeoff is optimized by using a sampling rate of 
either 1.5 or 1.9 MHz, with no significant advantage associated 
with either of the two sampling rates, 
(m) Only after an intraframe coder is operating at a bit rate higher 
than 7.5 Mb/s (points 5b-1.9, 6b-1.5, and 7b-1.2) does it become 
beneficial to sample at 2 MHz. 

VI. SUBJECTIVE EVALUATION OF PICTUREPHONE CODERS 

The main point indicated in Section V was that the optimum sam- 
pling rate is some function of the transmission rate. This point is most 
dramatically illustrated for a transmission rate of 6. Mb/s. Here it is 
predicted that a 4-bit-l. 5-MHz coder will outperform the Phase 
coder (3-bit-2.0-MHz). In order to check this point subjectively, the 
same 30 observers used in Section IV were given an AB test between 
the Phase coder and a coder consisting of a crispened Gaussian pre- 
filter 20 dB down at 1 MHz, a synchronized 1.512-MHz sampling clock, 
and the Phase 1 4-bit quantizer. Each observer was asked to pick the 
most pleasing coder for Mike and Karen and the most readable coder 
for the graph. A summary of the results for the Phase 0-4-bit-l. 5-MHz 
AB comparison is given below : 





Prefer Phase 
22 


Prefer 4-bit-l. 5-MHz 


Mike: 


8 


Karen : 


3 


27 


Graph : 


7 


23 


All: 


32 


58 



Overall, the 4-bit-l. 5-MHz coder was preferred over Phase by a 
margin of nearly 2:1, giving some justification for the two-factor 
tradeoff analysis. 
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In order to find out where the edge busyness — rise time impairments 
lie on an absolute quality scale, a series of unimpaired-impaired type 
comparisons were used to rate six different coders on a five-comment 
impairment scale. For each of the three camera sources, each observer 
made a comparison of all six coders to the unimpaired analog video 
signal and was asked if the impairment added by the coder was : 

(i) not noticeable (weight = 1) 

(ii) just noticeable (weight = 2) 

(Hi) noticeable but not objectionable (weight = 3) 

(iv) objectionable (weight = 4) 

(v) extremely objectionable (weight = 5). 

Six different coders were evaluated ; these were : 

(i) Phase 1 (1.856-MHz, 4-bit DPCM, 6-dB Gaussian pre- 

filter) 
(ii) Phase (2.0-MHz, 3-bit DPCM, 6-dB Gaussian prefilter) 
(Hi) the 4-bit— 1. 5-MHz coder used in the previous AB comparison 

with Phase 
(iv) a 2.0-MHz sample-and-hold system using a 6-dB crispened 

Gaussian prefilter (System 1 of Section IV) 
(v) a 1.5-MHz sample-and-hold system using a 20-dB crispened 

Gaussian prefilter (System 4 of Section IV) 
(vi) a 1.2-MHz sample-and-hold system using a 40-dB crispened 

Gaussian prefilter (System 9 of Section IV). 

The detailed results of the comment scale ratings in terms of means 
and standard deviations for Mike, Karen, the graph, and all of the re- 
sponses are listed in Table IV. When all 90 responses are lumped to- 
gether, the six coder means are given below (see the next to the last 
row of Table IV). 

Mean Comment Scale Rating 



2.0-MHz 


sample 


-and-hold : 


1.63 


1.5-MHz 


sample 


-and-hold : 


2.12 


1.2-MHz 


sample-and-hold : 


3.46 


Phase 1 : 






2.59 


Phase : 






3.31 


4-bit-1.5-MHz : 




3.30 



Here a lower mean comment scale rating is preferable, indicating a 
lower amount of impairment added by the coder. Note that although 
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Table IV — Comment Scale Results 



Coder : 
Prefilter: 
Samp. Rate: 
Quantizer : 


4b-1.5 
20 
1.5 
4 


Phase 
6 

2.0 
3 


Phase 1 
6 

1.9 
4 


2.0 S&H 
6 
2.0 


1.5 S&H 
20 
1.5 


1.2 S&H 
40 dB 
1.2 MHz 
— Bits 


Mike 
Mean: 
Standard 
Deviation : 


3.27 
0.96 


2.85 
0.78 


2.50 
0.86 


1.78 
0.79 


2.10 
0.61 


3.05 

0.78 


Karen 
Mean: 
Standard 
Deviation : 


3.23 
0.63 


3.38 
0.54 


2.48 
0.72 


1.28 
0.50 


1.60 
0.65 


3.32 
0.89 


Graph 
Mean: 
Standard 
Deviation : 


3.40 
0.85 


3.68 
0.66 


2.80 
0.88 


1.82 
0.76 


2.67 
1.06 


4.00 
0.96 


All 90 Responses 
Mean: 
Standard 
Deviation : 


3.30 
0.83 


3.31 
0.75 


2.59 
0.84 


1.63 
0.74 


2.12 
0.91 


3.46 
0.97 



on an AB test the 4-bit— 1. 5-MHz coder was chosen over the Phase 
coder by a margin of almost 2:1, when placed on a quality scale and 
separated in time, they are rated as being equal. In using these ratings 
it should be remembered that the three camera sources were picked to 
give all intraframe coders trouble. If Mike did not have a striped shirt 
or Karen a striped blouse, these tests would have shown very little 
difference between the coders and all would have been rated much more 
leniently. 

In Fig. 22, the six coders used in the above subjective tests are 
plotted on an edge busyness— rise time chart. A 40-percent step size 
is used for both the Phase and Phase 1 coders while a 50-percent 
step size is used for the 4-bit-l. 5-MHz coder. All sample-and-hold 
coder coordinates are independent of the step size due to lack of a 
quantizer. With the mean scale values indicated in parentheses, it be- 
comes apparent that the general edge busyness — rise time analysis is 
further substantiated. The closer the coordinates to the analog case 
the better, with almost equal weighting between the edge busyness 
and rise time axes, edge busyness being slightly more undesired than 
reduced rise time. (See Section IV where points just under the knee of 
each curve were chosen.) The two dashed circles with centers on the 
analog coordinates approximate equal-impairment contours with the 
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Fig. 22 — Comment scale values as a function of edge busyness and rise time. 



dashed 45-degree line giving equal weighting to both edge busyness 
and increased average rise time. 

Finally it is noted that only station sets using crispened Gaussian 
postfilters which are 20 dB down at 1 MHz have been assumed. Other 
postfilters were tried, including some in the Butterworth family, and 
similar results were obtained. Any postfilter which affects the analog 
system to only a minor degree is not expected to significantly change 
the results presented in this paper. 



VII. SUMMARY 

A description of the two main degradations inherent in present intra- 
frame video coders has been presented. Methods allowing a tradeoff 
between these two degradations, edge busyness and reduced rise time, 
have been described; and optimum coder composition, including pre- 
filtering, sampling rate, and quantizer structure, has been presented as 
a function of transmission rate. A subjective testing program has been 
carried out which substantiates the edge busyness — rise time evaluation 
of video coders and rates a number of intraframe coders on a five- 
comment impairment scale. 
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APPENDIX 

Edge Busyness — Rise Time Program 

Program Inputs: (i) Step Size [A] 

(ii) Prefilter Step Response [6(0] 

(Hi) Sampling Rate [/. = 1/T] 

(iv) Quantizer Structure. 

Program Computations : 

A.l For each of 20 uniform phasings: 

(*,,» = 1,20) = (i- 1)7720. (3) 

A. 1.1 Determine a series of sample-and-hold values for the system 
as given in Fig. 3. 

(Xt it j = 1, 10) = (A&O,- + T(j - 1)]). (4) 

A. 1.2 Transform sample-and-hold series from A. 1.1 into a series 
of values that would be transmitted over the digital channel. 

Y ix = Q(Xa) (5) 



(f, j ,j = 2,io) = (q[x - l r«]) 



(6) 



where Q(a) refers to the quantized value of a that can be 
calculated using the quantizer structure. All thresholds are 
located midway between the output levels. 
A. 1.3 Digitally add the quantized series from A. 1.2 as would be 
done by the decoder to result in the quantized version of the 
sample-and-hold values of A. 1.1. 

(z ij ,j= 1,10) =(pj ik y (7) 

A. 1.4 Since the series produced in A. 1.3 represents the amplitude 
of a series of rectangular pulses approximating the original 
input video signal, summing a series of appropriately ampli- 
fied and delayed Picture-phone receiver step responses as 
given by eq. (2) of the text will result in the video output of 
the Picturephone station set. 



10 



Z(t) = E Y ik slt - fr - (k - l)? 7 ]. (8) 



fc-i 
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A. 1.5 Use a search procedure on this video output to find the in- 
stants of time when the waveform is equal to 10, 50, and 90 
percent of the input step amplitude. 

A.2.1 Calculate the output 

Tr = -!- £ (Tm ~ T il0 ) (9) 

as the average 10- to 90-percent rise time. 
A. 2. 2 Calculate and output 

EB = max (7\- 60 ) - min (T i60 ) (10) 

t t 

as the edge busyness measure. 

The above procedure can be used to calculate the combined effects 
of SIB and QIB or the effect of SIB only, since replacing the quantizer 
with an identity equation leaves only the effects of SIB to be calcu- 
lated. It is not possible to calculate the effects of QIB only, since 
presampling is required for the quantizer. 

For all plots given in this paper, eq. (8) can be placed inside a DO 
loop to calculate the output waveform for a wide range of times. 
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